Porting Statistical Parsers with Data-Defined Kernels

نویسندگان

  • Ivan Titov
  • James Henderson
چکیده

Previous results have shown disappointing performance when porting a parser trained on one domain to another domain where only a small amount of data is available. We propose the use of data-defined kernels as a way to exploit statistics from a source domain while still specializing a parser to a target domain. A probabilistic model trained on the source domain (and possibly also the target domain) is used to define a kernel, which is then used in a large margin classifier trained only on the target domain. With a SVM classifier and a neural network probabilistic model, this method achieves improved performance over the probabilistic model alone.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Early Experiences Porting Scientific Applications to the Many Integrated Core (MIC) Platform

This paper presents experiences using an early development environment release of the forthcoming Intel MIC platform, focusing on porting of existing scientific applications and micro-kernels. Fortran and C++ applications are chosen from disciplines including quantum mechanics, hypersonics, rarefied gas dynamics, finite-element analysis, and FFT and linear algebra kernels used in the direct num...

متن کامل

Gpu Acceleration of the Long-wave Rapid Radiative Transfer Model in Wrf Using Cuda Fortran

This paper presents the approach and results of porting the Long-Wave Rapid Radiative Transfer Model (RRTM) component of the Weather Research and Forecast (WRF) code to the GPU using CUDA Fortran. After a brief description of the RTTM code, considerations regarding porting the application to the GPU are discussed. Included in the porting discussion are how the data structures have been modified...

متن کامل

Information Diffusion Kernels

A new family of kernels for statistical learning is introduced that exploits the geometric structure of statistical models. Based on the heat equation on the Riemannian manifold defined by the Fisher information metric, information diffusion kernels generalize the Gaussian kernel of Euclidean space, and provide a natural way of combining generative statistical modeling with non-parametric discr...

متن کامل

A Pattern Language for Porting Micro - kernelsMichel

Micro-kernels are diicult to port to a new hardware platform. During the initial phases of a port, much time and eeort is lost on debugging critical machine-dependent subsystems. These subsystems are generally very tightly coupled and cannot be tested in an incremental fashion. Tight coupling occurs because the subsystems share many global variables forcing them to be debugged with the complete...

متن کامل

Patterns to Ease the Port of Micro-kernels in Embedded Systems

Micro-kernels are diicult to port to a new hardware platform. During the initial phases of a port, much time and eeort is lost on debugging critical machine-dependent subsystems. These subsystems are generally very tightly coupled and cannot be tested in an incremental fashion. Tight coupling occurs because the subsystems share many global variables forcing them to be debugged with the complete...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006